Performance Analysis of Sequential Element for Low Power Clocking System

ثبت نشده
چکیده

The sequential element (flip-flop) is a basic building block to design any clocking system, which consists of the clock distribution tree and flip-flops. A large portion of the on chip power is consumed by the clocking system the total power consumption of the clocking system depends on both clocking distribution tree and also the register elements (flip-flops). The power consumption of register element is higher than that of the clocking distribution tree the objective is to reduce the power consumption by the register elements (flip-flop). A method of Conditional data mapping Flip Flop (CDMFF) was proposed earlier. The drawbacks of CDMFF are, it uses more number of transistors and it has a floating node on its critical path. Additionally it cannot be used in noise intensive environment. For that a method called Clocked Pair Shared Implicit pulsed Flip Flop (CPSFF) is proposed. In this method the number of transistors is reduced by sharing the clocked pair transistors. The floating node problem is also avoided by using precharge transistors. The design can be implemented in DSCH and MICROWIND 3.1 CMOS Layout tool. The performance is analyzed in views of number of transistors (N), Area (A), power (P), delay (D-Q), power delay product (PDP). Analysis of the performance parameters shows that performance of CPSFF is superior compared to the conventional Flip Flop. Overall power is reduced in CPSFF when compared to the previous method CDMFF. A 20% reduction of power can be achieved in Clocked Pair Shared Flip Flop (CPSFF). In addition due to the absence of floating node problem low swing voltage and dual edge clocking can be easily employed into the proposed register element (flip-flop) to construct clocking system. Key Terms: Flip flop; clocking system; register element; CMOS; delay; low power INTRODUCTION The system on chip design is integrating hundreds of millions of transistors on one chip, whereas packaging and cooling only have a restricted ability to eradicate the excess heat. All of these results in power utilization being the tailback in achieving Umayal.S, International Journal of Computer Science and Mobile Computing, Vol.3 Issue.4, April2014, pg. 175-183 © 2014, IJCSMC All Rights Reserved 176 high performance. The clock system which consists of clock distribution tree and the register elements (Flip Flops and Latches), is one of the most power consuming components in a VLSI system. It accounts for 20% to 40% of the total power dissipation of a system. As a result minimizing the power consumed by register element will have a deep impact on the total power consumed. A large portion of the on chip power is utilized by the clock drives. Carefulness must be paid to reduce the clock load when construct a clocking system. Many current microprocessors selectively use master-slave and pulse triggered Flip Flops. Traditional master-slave single edge Flip Flops for example transmission gated Flip Flops, are constructed by 2 stages, one master and another slave. All of these Flip flops are characterized by hard edge properties such as positive setup time causing huge D to Q delay. On the other hand, pulse triggered Flip Flops diminish the two stages into one stage and are characterized by the soft edge property. 95% of all static timing latching on the Itanium processor is pulsed clocking. Pulse triggered Flip Flops could be classified into two types, implicit pulsed and explicit pulsed, for implicit pulse triggered data close to output Flip Flops (IP-DCO) and explicit pulse triggered data close to output Flip Flops (EP-DCO). METHODOLOGIES FOR LOW POWER DESIGN OF REGISTER ELEMENT Power consumption is determined by several factors including frequency f, supply voltage V, data switching activity , capacitance C, leakage current, and short circuit current. P=PDynamic + Pshort circuit + Pleakage (1) In the above equation, dynamic power Pdynamic is also called the switching power P = CV2f (2) Pshort circuit is the short circuit power which is caused by the finite rise and fall time of input signals, resulting in both the pull up network and pull down network to be ON for a short while. Pshort circuit = Ishort circuit .Vdd (3) Pleakage is the leakage power. With supply voltage scaling down, the threshold voltage also decreases to maintain performance. However, this leads to the exponential growth of the subthreshold leakage current. Subthreshold leakage is the dominant leakage now. Pleakage = Ileakage .Vd (4) A. Dual Edge clocking Using half frequency on the clock distribution network will save approximately half of the power consumption on the clock distribution network. However the flip-flop must be able to be double clock edge triggered. For example, the clock branch shared implicit pulsed flip-flop (CBS-IP DETFF), is a dual edge triggered flip-flop. Dual clock edge triggering method reduces the power by decreasing frequency f in (2). B. Low Swing Voltage on Clock Distribution Tree Using a low swing voltage on the clock distribution network can reduce the clocking power consumption since power is a quadratic function of voltage. To use low swing clock distribution, the flip-flop should be a low swing flipflop. For example, low swing double-edge flip-flop (LSDFF) is a low swing flip-flop. In addition, the level converter flip-flop is a natural candidate to be used in low swing environment too. For example, CDLCFF-ip, could be used as a low swing flip-flop since incoming signals only drive Umayal.S, International Journal of Computer Science and Mobile Computing, Vol.3 Issue.4, April2014, pg. 175-183 © 2014, IJCSMC All Rights Reserved 177 nMOS transistors. The low swing method reduces the power consumption by decreasing voltage in (2). C. Dual Threshold Voltage/MTCMOS Using Dual Vt/MTCMOS to reduce the leakage power in standby mode. With shrinking feature size, the leakage current increases rapidly, the MTMOS technique as well as transistor stacking, dynamic body biasing, and supply voltage ramping could be used to reduce leakage standby power consumption . Split path can reduce the short current power, since pMOS and nMOS are driven by separate signals. D. Reducing Capacity of Clocked Transistors 80% of no clocked nodes have switching activity less than 0.1. This means reducing power of clocked nodes is important since clocked node has 100% activity. One effective way of low power design for clocking system is to reduce clock capacity load by minimizing number of clocked transistor. Any local clock load reduction will also decrease the global power consumption. E. Reduction of Transient Activity There are two ways to reduce the switching activity: conditional operation (eliminate redundant data switching conditional discharge flip-flop (CDFF), conditional capture flip-flop (CCFF)) or clock gating. The conditional operation technique is needed to control the redundant switching. In CDFF, a feedback transistor is inserted on the discharging path of 1st stage which will turn off the discharging path when D keeps 1. Internal node will not be kept discharging at every clock cycle. F. Clock Gating When a certain block is idle, we can disable the clock signal to that block to save power. Both conditional operation and clock gating methods reduce power by decreasing switching activity. DESIGN OF CDMFF (CONDITIONAL DATA MAPPING FLIP FLOP) A large portion of the on-chip power is consumed by the clock drivers. It is desirable to have less clocked load in the system. CDFF and CCFF both have many clocked transistors. For example, CCFF used 14 clocked transistors, and CDFF used 15 clocked loads. In contrast, conditional data mapping flip-flop (CDMFF, Fig1) used only seven clocked transistors, resulting in about 50% reduction in the number of clocked transistors, hence CDMFF used less power than CCFF and CDFF. (Note that CDFF used double edge clocking. This shows the effectiveness of reducing clocked loads numbers to achieve low power. Since CDMFF outperforms CCFF and CDFF in vision of power consumption. Umayal.S, International Journal of Computer Science and Mobile Computing, Vol.3 Issue.4, April2014, pg. 175-183 © 2014, IJCSMC All Rights Reserved 178 Fig. 1. Conditional Data Mapping Flip Flop (CDMFF) However, there is redundant clocking capacitance in CDMFF. When data remains LOW or HIGH, the precharging transistors, P1 and P2, keep switching without useful computation, resulting in redundant clocking. Clearly, it is necessary to reduce redundant power consumption here. Further, CDMFF has a floating node on critical path because its first stage is dynamic. When clock signal CLK changes from LOW to HIGH, CLKDB will stay HIGH for a short while which produces an implicit pulse window for evaluation. During that window, both P1, P2 are off. In addition, if D transits from LOW to HIGH, the pull down network will be disconnected by N3 using data mapping scheme (N6 turns off N3); If D is LOW, the pull down network is disconnected from GND too. Hence internal node X is not connected with supply Vdd or GND. With feature size shrinking, dynamic node is more prone to noise intermission because of the undriven dynamic node. If a close by noise discharges the node X, pMOS transistor P3 will be partially on, and a glitch will appear on output node Q. In a nanoscale circuit, a glitch consumes power and also propagates to the next stage which results the system more weak to noise. Hence, CDMFF could not be used in noise intensive environment. Unlike CDMFF, other dynamic flip-flops employ structure to prevent the floating point. Finally it is hard to apply the low power methods to CDMFF. For example, the clock structure with precharging transistors P1, P2 in CDMFF makes it difficult to apply double edge triggering. Nor can CDMFF be used in a low swing clock environment. (Note that the incoming low swing clock signal cannot drive pMOS, P1 and P2, in high voltage block (VDDH), because the pMOS transistors will not turn off by a low swing voltage, resulting in short circuit power dissipation). Umayal.S, International Journal of Computer Science and Mobile Computing, Vol.3 Issue.4, April2014, pg. 175-183 © 2014, IJCSMC All Rights Reserved 179 DESIGN OF CPSFF (CLOCKED PAIR SHARED FLIP-FLOP BY REDUCING CLOCKED LOADS) Fig. 2. Clocked Pair Shared Flip Flop (CPSFF) CDFF and CCFF use many clocked transistors. CDMFF reduces the number of clocked loads but it has redundant clocking as well as a floating node. To ensure efficient and robust implementation of low power register element, Clocked Pair Shared flip-flop (CPSFF, Fig2) is proposed. It uses less clocked transistors than CDMFF and to defeat the floating problem in CDMFF. In the clocked-pair-shared flip-flop, clocked pair (N3, N4) is shared by first and second stage of the latching part. The pseudo nMOS transistor (i.e) pMOS, P1, is used to charge the internal node X rather than using the two clocked pre charging transistors (P1, P2) in CDMFF. Comparing with CDMFF, a total of three clocked loads are reduced, such that the clock load seen by the clock driver is minimized, resulting in an efficient design. Further the transistor N7 in the clocked inverter in CDMFF is rejected. CPSFF uses four clocked loads rather than seven clocked loads in CDMFF, resulting in just about 40% reduction in number of clocked loads. Additionally the internal node X is connected to supply Vdd by a pseudo nMOS P1, so is not floating, and results improvement of noise robustness of node X. This mitigates the floating point problem in CDMFF. The pseudo nMOS P1 is a weak pMOS transistor. This scheme combines pseudo nMOS with a conditional mapping method where a feedback signals, comp, controls the transistor nMOS N1. When input D stays HIGH, Q=1, N5 is ON, N1 will shut off to avoid the redundant transient activity at node X as well as any short circuit current. Pmos P2 should pull Q high when D switches to 1. The second nMOS branch (N2) is in charge for pulling down the output of Q if D = LOW and Y=1when the clock pulse arrives. pMOS in I1 should turn on Nmos N2 when D=LOW. Although P1 is always ON, short circuit only occurs one time when D makes a transition of LOW>HIGH, and the discharge path is disconnected after two gates delay by comp (turning off N1). After that, if D remains at HIGH, the discharge path is already disconnected by N1; there would be no short circuit. The clocked-pseudo-nMOS scheme is different from the general idea of conventional pseudo-nMOS logic in that we use clocked transistors in the pull down branch. P1, N1, N3, and N4 should be properly scaled to guarantee a correct noise margin. Several low power methodologies can be easily included into the new register element. Unlike CDMFF, low swing clock voltage method is possible for Umayal.S, International Journal of Computer Science and Mobile Computing, Vol.3 Issue.4, April2014, pg. 175-183 © 2014, IJCSMC All Rights Reserved 180 CPSFF since arriving low voltage clock does not drive pMOS transistors. Low swing voltage clock signals could be connected to the nMOS transistors N3 and N4, respectively. Additionally, it is easy to construct dual edge triggering flip-flop based on the simple clocking construction in CPSFF. Further the incoming clock and data signals only drive nMOS transistors. So CPSFF could be used as a level converter flip-flop automatically.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Design and Analysis of Register Element for Low Power Clocking System

The register element (flip-flop) is a basic building block to design any clocking system, which consists of the clock distribution tree and flip-flops. A large portion of the on chip power is consumed by the clocking system the total power consumption of the clocking system depends on both clocking distribution tree and also the register elements (flip-flops). The power consumption of register ...

متن کامل

Study of Static Power of Clocked Pair Shared Flip Flop For Low Power Clocking System

In the past, the major issue of the VLSI designer were area, cost, performance, and reliability; power consideration was mostly of only inferior importance. But over the last few years power in the circuit is the major problem now days which is being faced by the very large scale integration industries. The power dissipation in any circuit is usually take place by the clocking system which incl...

متن کامل

Performance Analysis of Reversible Sequential Circuits Based on Carbon NanoTube Field Effect Transistors (CNTFETs)

This study presents the importance of reversible logic in designing of high performance and low power consumption digital circuits. In our research, the various forms of sequential reversible circuits such as D, T, SR and JK flip-flops are investigated based on carbon nanotube field-effect transistors. All reversible flip-flops are simulated in two voltages, 0.3 and 0.5 Volt. Our results show t...

متن کامل

A Small Printed Quadrifilar Helical Antenna for BGAN/GPS Applications

This paper presents an element of a 2×2-element array antenna for Inmarsat BGAN/GPS applications. The element is an axial mode printed quadrifilar helical antenna that has been integrated with a compact feed network to provide sequential phase rotation for circular polarization (CP) radiation. The novel integrated lumped-element feed network is designed to provide a balanced RF power to the fou...

متن کامل

Experimental and 3D Finite Element Analysis of a Slotless Air-Cored Axial Flux PMSG for Wind Turbine Application

In this research paper, the performance of an air-cored axial flux permanent magnet synchronous generator is evaluated for low speed, direct drive applications using 3D finite element modeling and experimental tests. The structure of the considered machine consists of double rotor and coreless stator, which results in the absence of core losses, reduction of stator weight and elimination of cog...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014